System, method and mobile unit to sense objects or text and retrieve related information

ABSTRACT

A system, method and mobile unit to sense objects and/or text and retrieve related images, text or translations. The system may allow users to instantly access information about an object or text with a mobile device, such as a mobile phone, pager, personal digital assistant (PDA) or lap top computer.

CLAIM OF PRIORITY UNDER 35 USC

The present application claims priority to co-owned U.S. Provisional Application No. 60/616,256, filed on Oct. 7, 2004, entitled “SYSTEM, METHOD AND MOBILE UNIT TO SENSE OBJECTS OR TEXT AND RETRIEVE IMAGES, TEXT OR TRANSLATIONS,” and is hereby incorporated by reference in its entirety.

BACKGROUND

Some mobile phones currently have built-in cameras, which may be called camera phones. These camera phones allow a user to transmit images to other devices through a communication system.

SUMMARY

A system, method and mobile unit are described to sense objects and/or text and retrieve information related to the object or text, such as images, video, text or language translations. The system may allow users to instantly view or access information about an object or text with a mobile device, such as a mobile phone, pager, personal digital assistant (PDA) or lap top computer.

The system may be faster and more accurate than a system that only uses human searchers to find information related to an object or text. The system may use human searchers in addition to computer searchers.

Using images to identify objects or text may be faster and more accurate than a system that uses speech-recognition to describe and identify images or text because images may be more precise than words. As the saying goes, a picture may be better than a thousand words.

DESCRIPTION OF FIGURES

FIG. 1 illustrates a mobile unit, a communication system, a server, the Internet and storage units.

FIG. 2 illustrates a method of using the mobile unit, communication system and server of FIG. 1.

FIG. 3 illustrates a method of using the mobile unit of FIG. 1.

FIG. 4 illustrates an example of information that may be displayed on the mobile unit of FIG. 1.

DETAILED DESCRIPTION

Communication System

The communication system or network 140 may comprise any type of components configured to provide wireless communication with one or more mobile units 100. For example, the communication system 140 one or more base stations (BS) 140 (also called base station transceivers), base station controllers (BSCS) 142, mobile switching centers (MSCs) 143, packet data switch networks (PDSNs) 144, Internet servers 145, storage units, etc. The communication system 140 may use a multiple access technology, such as time division multiple access (TDMA), code division multiple access (CDMA), wideband CDMA (WCDMA), Global System for Mobile Communications (GSM), time-division synchronous CDMA (TD-SCDMA), etc. Alternatively, the communication system 140 may comprise “hotspots” (transceivers) for WiFi or WiMax. The communication system 140 may use communication protocols, such as packet data transmission protocols.

Mobile Unit

The mobile unit 100 may be a mobile phone, a pager, a personal digital assistant (PDA), a laptop computer, a digital camera or some other mobile device. The mobile unit 100 may be a commercially available device, such as a camera phone made by Nokia, Motorola, Samsung, LG Electronics, Ericsson, Kyocera, etc. Alternatively, the mobile unit 100 may be a modified commercially available device, e.g., a camera phone with added software 107. Alternatively, the mobile unit 100 may be a new mobile device designed to implement some or all of the functions described herein.

The mobile unit 100 may comprise a sensor 102, a processor 104, a memory 106, software 107, a database 116 of information, such as images, videos, text files and other data, a transceiver 108, a speaker 110, a microphone 112, a display 114, a keypad or other user input component 116, and an optional position location unit 118. The mobile unit 100 may include other components in addition to or instead of the components listed herein. The mobile unit 100 may include less than all of the components shown in FIG. 1. Some of the components in FIG. 1 may be integrated.

The keypad 116 may include one or more keys or buttons on various sides (front, rear, top, back, lateral, etc.) of the mobile unit 100 and are not limited to any one side. The keys or buttons are not limited to any material, arrangement or structure.

The sensor 102 may take or capture a snapshot image (picture) or a video of an object or text. The sensor 102 may be similar to standard image sensors in currently available camera phones, or the sensor 102 may be a specially designed image sensor. The sensor 102 may have a photodetector, such as a charge-coupled device (CCD) and/or complementary metal oxide semiconductor (CMOS) circuit. The sensor 102 may have a user-controlled zoom function or image magnification with one of the keys of the keypad 116. Alternatively, the zoom function or image magnification may be automatically controlled by the mobile unit 100. The sensor 102 may have a user-controlled or automatic focus function to focus on one or more objects.

User Commands—by Voice or Keypad

For any user-initiated functions on the mobile unit 100 described herein, the user may press one or more keys on the keypad 116 or speak a command into the microphone 112. For example, the user may press one or more keys on the keypad 116 and/or speak a command into the microphone 112 to activate the sensor 102, which senses or captures an image or picture of an object or text.

Examples of objects may include a car, a building, a monument, a person, an animal, a plant, a piece of clothing, jewelry, a poster, a billboard, a sign, an image on a TV, computer display or other display, a comic book, a stamp, a coin or other collectible. Examples of people include professional athletes, sports teams, musicians, actors/actresses, celebrities, politicians, etc.

Examples of text may include a sign, an address, a business card, a book, billboard, poster, etc. The sensor 102 and/or the processor 104 may convert the image of an object or text into data, such as digital data, e.g., a sequence of bits.

Before, during or after the sensor 102 is activated to capture an image of an object or text, the user may enter one or more commands via one or more keys on the keypad 116, the display 114 or microphone 112. Examples of user commands may include “find information related to object,” “find info,” “find definition,” “find photo,” “find video,” “find related images,” “find news,” “find history,” “find story,” “find product,” “buy,” “sell,” “find best deal,” “find nearest store,” “find seller,” “find person,” “find web site,” “find email,” “find address,” “find phone number,” “find nearest location,” “get map,” “find directions,” “translate,” and “translate in <language X>.”

The user may enter at least one search word via one or more keys on the keypad 116 or say a word or phrase into the microphone 112. The search word(s) may help identify the object and/or narrow a search to find specific information related to the object. For example, the search word may be a name of the object or what the user believes is a name or brief description of the object.

In one configuration, the processor 104 may execute software 107 to search the database 116 and identify the object or text, retrieve related information, and display information on the display 114 or play audio files by the speaker 110.

In another configuration, the processor 104 may cause the transceiver 108 to transmit the image data and a user command to the server 150 via the communication system 140. The user command may be represented by a short sequence of bits. This configuration may save processing power and memory space on the mobile unit 100 compared to the configuration described above. The server 150 may process the command and send requested data to the mobile unit 100.

In another configuration, the processor 104 first executes software 107 to search the database 116 and try identifying the object or text. If the object or text cannot be identified, e.g., within a time period, the processor 104 then causes the transceiver 108 to transmit the image data and user command to the communication system 140.

The mobile unit 100 may automatically or upon a user input (e.g., user may dial a phone number or press one or more buttons on the keypad 116 or microphone 112) send the image data and user command to the communication system 140, which sends the image data and user command to the server 150. For example, the mobile unit 100 may send the image data to a base station 141, which sends the data to a base station controller (BSC) 142, which sends the data to a mobile switching center (MSC) 143, which sends the data to a packet data switching network (PDSN) 144, which sends the data to the server 150. Other communication routes between the mobile unit 100 and server 150 may involve more components, less components, or other components of the communication system 140.

Display

FIG. 4 illustrates an example of some information that may be displayed on the display 114. All the information may not be displayed at one time on one screen. Some of the information may be displayed on different screens of the display 114. The screens may be switched or scrolled by user commands. The display 114 may display one or more pieces of information according to when user commands are received by the mobile unit 100. For example, the display 114 may show an image 402 when the user activates the sensor 102. The display 114 may show a search word if the user enters a search word. The display 114 may show an object name and/or related image or video 404 after a search occurred for the image 402.

As shown in FIG. 4, the display 114 may show information “options” (also called categories or types) related to the object, such as buy, sell, recent news, history or background, statistics, etc. These options may be displayed before a search and/or after a search. Displaying the options before a search may allow the user to select one or more categories of information and help narrow a search before the search occurs. A narrower search may provide one or more advantages, such as reducing search time, processing power, processing time, memory at the mobile unit 100 or server 150 to store search results, bandwidth and/or capacity of the communication system 140, mobile unit 100 and/or server 150, etc.

In addition to or instead of displaying these options before a search, the display 114 may show the options after a search occurs. These options allow the user to select one or more categories of information to display or audio or video files to play after a search. These options after a search may provide one or more advantages, such as reducing processing power, processing time, memory at the mobile unit 100 or server 150 to store search results, bandwidth and/or capacity of the communication system 140, mobile unit 100 and/or server 150, etc. Without these options, a large amount of bandwidth and capacity may be consumed to retrieve information for display or playback on the mobile unit 100.

The “buy” option may allow a user to buy an object in the image 402 or a product related to an object in the image 402. The buy option may be linked to a specific seller's web site or a web site such as Amazon.com or eBay. For example, if the image 402 shows a sports team, a musician or movie, the buy option may retrieve product links related to the a sports team, musician or movie, such as T-shirts, hats, other clothes, posters, compact discs, etc.

One or more options may be user-configurable or capable of being modified by the user. For example, the user may specify a local or specific region for buying or selling an object in the image 402. Examples of regions may include states, such as Virginia, Maryland or California, east coast, west coast, midwest, south, or one or more countries. The user may specify a time period for retrieving recent news such as within the last X days or Y weeks or Z months, etc. The user may specify a type of history, background or statistics. The order of the options displayed on the display 114 may also be configured or modified by the user.

As shown in FIG. 4, the user may enter or select the user's own option or category of information to retrieve related to the image 402.

Server

The computer or server 150 may be a single unit or multiple units, such as a distributed computer system. The server 150 may be at a single location or multiple locations. The server 150 may be coupled to the Internet 180 and/or data storage units 190.

The server 150 may have software, hardware or a combination of software and hardware to implement the functions described herein. The server 150 may include other components in addition to or instead of the components listed herein. Some or all of these components or functions may be implemented in the mobile unit 100.

The server 150 may include a transceiver 151, processor 152, a memory 154, a database 156, a search engine 158, an image processing module 160, an object identifier 161, a text recognition module 162, a language translation module 164, an information blocker 166, etc. These modules are shown and described herein as separate modules, but one or more of these modules may be combined or integrated. Although shown in the memory 154, the search engine 158, image processing module 160, object identifier 161, text recognition module 162, language translation module 164 and information blocker 166 may be implemented in whole or in part with hardware, such as application specific integrated circuits (ASICs).

The server 150 may receive image data and one or more user commands from the mobile unit 100, retrieve related data/information from the Internet 180 and/or data storage units 190, such as images, video and/or text (described below), and send the related data to the mobile unit 100 via the communication system 140. The server 150 may perform these functions in real time. The mobile unit 100 and server 150 may act as a quick dictionary, encyclopedia, reference source, or link to related information on the Internet 180.

The database 156 may store data, such as image files, video files and/or text files, which may be transmitted by the transceiver 151 and communication system 140 to the mobile unit 100. The database 156 may also store text data in more than one language.

Search Engine

The search engine 158 may search for web sites, hypertext links, text, and other data or structures on the Internet 180 that match or are related to the image of text from the mobile unit 100. The search engine 158 may be similar to search engines used by Yahoo, Google or a modified version of such search engines.

Image Processing Module

The image processing module 160 may process the image from the mobile unit 100 to clarify an object or text. For example, the image processing module 160 may enhance or focus one or more parts of an image, filter or remove non-relevant objects (such as shadows, rain, snow), increase or decrease contrast between light and dark areas, adjust brightness, adjust color, adjust focus, interpolate pixels to reduce the effects of blurs, reflections, remove red eye, etc. Other image processing functions may be known to the camera, video and image processing art, but combining image processing and other functions described herein are not known.

Object Identifier

The object identifier 161 may have one or more modules, which may be separate or integrated. The object identifier 161 may try to identify or recognize one or more objects of the image from the image processing module 160. The object identifier 161 may use the Internet 180 or the database 156 to try to find objects or information (images, videos, text) that match or are similar to the object in the image from the mobile unit 100.

The object identifier 161 may use object parameters such as shape (e.g., round, rectangular, oval, angular, sharp edges, curved edges, human face, shape of car, shape of commercial product), color, color pattern (e.g., monotone, spotted, striped, number of colors), surface texture (e.g., smooth, rough, reflective, furry, bumpy), material composition (e.g., metallic, wood, rock, plastic), arrangement, structure, facial features, spacing between lines or regions, size of an object relative to other objects, etc.

The object identifier 161 may use various levels of object classification to help identify an object. For example, one object classification may be whether the object is a living organism, such as a person, animal, plant, fungi, etc. or a non-living organism, such as a rock, vehicle, building, statue, sign, etc. The living organisms may further be classified as a person, animal, plant or fungi. People may be classified as professional athletes, sports teams, musicians, actors/actresses, celebrities, politicians, etc. The non-living organisms may further be classified as man-made or natural. Another object classification may be whether the object is inanimate. Another object classification may be whether the object is mechanical or electrical.

Text Recognition Module

The text recognition module 162 may categorize text by language (e.g., English, Chinese, Spanish) and style (e.g., cursive, font, numeric). Some text recognition techniques are known, such as character recognition machines available from Scantron Corporation of Irvine, Calif. Other text recognition techniques may be known to the camera, video and image processing art, but combining text recognition and other functions described herein are not known.

Examples of Retrieved Images, Videos, Text, Audio Files

The server 150 may transmit some or all of the information related to the original object or text to the mobile unit 100. To conserve communication bandwidth and display space, the server 150 may only send brief titles of information found by the server 150.

The mobile unit 100 may display the information, such as text, images or video, on the display 114, as shown in FIG. 4. To conserve communication bandwidth and display space, the mobile unit 100 may first display brief titles of information found by the server 150. The mobile unit 100 may provide music or audible sounds that describe the object or text, such as a voice recording defining, describing or translating the object or text.

Retrieved images may include a file, a photograph, a drawing, a computer-created graphic, a web site, a link to a website, a news story or a commercial advertisement, which may be displayed on the mobile device 100.

Retrieved videos may include a documentary, a news clip, a music video, a biography, a cartoon, an advertisement, etc.

Retrieved text may be a definition, a description of the object, one or more news articles, driving directions, geographic information, an advertisement, a language translation, etc. The display 114 may display a list of found categories of information or web site links, and the user may scroll through the categories or links.

Translation

In one embodiment, the server 150 may provide a language translation of the text, for example, from Chinese to English. Any language may be available to be selected by the user of the mobile unit 100. The mobile unit 100 may allow a user to select a desired language or set a default language for future translations.

Building a Database

The server 150 may build a database 156 of data (images, video files, audio files and text) related to objects and/or text by storing retrieved data from the Internet 180 and/or the storage units 190.

Information Blocker

A security or information blocker 166 may block certain information from being transmitted to the mobile unit 100 for privacy or safety issues. Examples may include names of non-famous people, personal phone numbers, email addresses, resident addresses, and other unauthorized information.

The functions described above may be implemented by hardware, software or a combination of hardware and software. Hardware may include a processor, a controller, application-specific integrated circuits (ASICs), field programmable gate arrays (FPGAs), etc.

A System, Method and Mobile Unit with a User Identifier and a Scannable Media to Store Credit or Debit Information

The mobile unit 100 may be a cellular phone, a personal digital assistant (PDA), a laptop computer or some other mobile device.

The mobile unit 100 may include one or more ways to identify the user, such as a retina scanner, a fingerprint scanner, a voice scanner, a heart rate or pulse scanner, etc. For highest security, the mobile unit 100 may have more than one way to identify the user. Alternatively, for cost or size concerns, the mobile unit 100 may have only one way to identify the user.

The mobile unit 100 may require a user to identify himself or herself using one or more ways when the user activates the mobile unit 100 to perform one or more of the functions described below.

The mobile unit 100 may provide one or more functions of a person's wallet or purse or contents of a wallet or purse. For example, the mobile unit 100 may serve as a credit card, a debit card, a driver's license, a passport membership card, an insurance card, an identification card, a company security badge, a business card holder, a movie rental card, a library card, a health fitness membership card, a purchase receipt holder, etc. Thus, a user does not have to carry a wallet.

Also, a user does not have to worry about the wallet being stolen. If the mobile unit 100 is stolen, it is useless because the thief does not have the retina, fingerprint and/or voice of the user to activate the mobile unit 100.

The mobile unit 100 may have a medium to store credit, debit information or other information listed above. The medium may be a scannable medium, such as a bar code or a magnetic strip. The medium may be an internal medium, such as a memory and/or an integrated circuit, which is detectable or adapted to communicate with external scanners or communicators, such as grocery store checkout scanners. 

1. A mobile device comprising: a sensor configured to capture an image of an object; a keypad configured to receive a user command to select a category of information related to the object, the information comprising at least one of an image, a video, an audio file, and text; a transceiver configured to (a) wirelessly transmit the image and user command to a communication system and (b) wirelessly receive titles of information within the category of information from the communication system; and a display configured to display the received titles.
 2. The mobile device of claim 1, wherein the mobile device comprises at least one of a mobile phone, a laptop computer, a personal digital assistant and a camera.
 3. The mobile device of claim 1, wherein the category comprises at least one of a definition, a related image, a related video, news, history, purchase information, selling information, location, nearest store, person, web site, email, address, phone number, map, directions, and translation.
 4. The mobile device of claim 1, wherein the titles comprise at least one of links to picture files, links to video files, links to text files, and links to web sites.
 5. The mobile device of claim 1, further comprising a voice recognition unit configured to receive the user command.
 6. The mobile device of claim 5, wherein the voice recognition unit is configured to receive at least one search word from the user, the search word being related to the object.
 7. The mobile device of claim 1, wherein the keypad is configured to receive at least one search word from the user, the search word being related to the object.
 8. The mobile device of claim 1, wherein the keypad is configured to receive a user command selecting a title among the received titles, the display being configured to show information related to the selected title.
 9. The mobile device of claim 1, wherein the display is configured to display a pre-stored list of categories for the user to select, the keypad being configured to receive a user command to select one of the categories.
 10. The mobile device of claim 1, wherein the keypad is configured to receive a user command to configure a category.
 11. A method of operating a mobile device, the method comprising: capturing an image of an object with the mobile device; receiving a user command to select a category of information related to the object, the information comprising at least one of an image, a video, and text; wirelessly transmitting the image and user command to a communication system; wirelessly receiving titles of information within the category of information from the communication system; and displaying the received titles.
 12. The method of claim 11, further comprising receiving at least one search word from the user, the search word being related to the object.
 13. A system comprising: a communication network configured to receive a user command and data representing an image of an object from a mobile device; and a computer configured to (a) receive the user command and data from the communication network, (b) access at least one of a database and the Internet to find information related to the object, the information comprising at least one of an image, a video, an audio file, text, a hypertext link, and a web site, and (c) send titles of the found information to the communication network; wherein the communication network is configured to send the titles to the mobile device.
 14. The system of claim 13, wherein the computer is configured to find a category of information, the category being specified in the user command.
 15. The system of claim 14, wherein the category comprises at least one of a definition, a related image, a related video, news, history, purchase information, selling information, location, nearest store, person, web site, email, address, phone number, map, directions, and translation.
 16. The system of claim 13, wherein the computer is configured to use a search word in the user command to find information related to the object.
 17. The system of claim 13, wherein the computer is configured to use an Internet search engine to find information related to the object.
 18. The system of claim 13, wherein the computer is further configured to (a) receive a second user command specifying one or more titles of found information and (b) send found information of the specified titles to the communication network.
 19. The system of claim 13, wherein the computer is further configured to process the image to clarify the object.
 20. The system of claim 13, wherein the computer is configured to use parameters of the object to find information related to the object, the parameters comprising at least one of shape, features, color, color pattern, surface texture, material composition, arrangement, structure, spacing between lines, and size of the object relative to other objects.
 21. The system of claim 13, wherein the computer is configured to use a plurality of object classifications to find information related to the object.
 22. A method comprising: receiving a user command and data representing an image of an object from a mobile device; accessing at least one of a database and the Internet to find information related to the object, the information comprising at least one of an image, a video, an audio file, text, a hypertext link, and a web site; and sending titles of the found information to the mobile device.
 23. The method of claim 22, further comprising using parameters of the object to find information related to the object, the parameters comprising at least one of shape, features, color, color pattern, surface texture, material composition, arrangement, structure, spacing between lines, and size of the object relative to other objects 